The Power of Adaptivity in Identifying Statistical Alternatives

نویسندگان

  • Kevin G. Jamieson
  • Daniel Haas
  • Benjamin Recht
چکیده

This paper studies the trade-off between two different kinds of pure exploration: breadth versus depth. We focus on the most biased coin problem, asking how many total coin flips are required to identify a “heavy” coin from an infinite bag containing both “heavy” coins with mean ✓ 1 2 (0, 1), and “light" coins with mean ✓ 0 2 (0, ✓ 1 ), where heavy coins are drawn from the bag with proportion ↵ 2 (0, 1/2). When ↵, ✓ 0 , ✓ 1 are unknown, the key difficulty of this problem lies in distinguishing whether the two kinds of coins have very similar means, or whether heavy coins are just extremely rare. While existing solutions to this problem require some prior knowledge of the parameters ✓ 0 , ✓ 1 ,↵, we propose an adaptive algorithm that requires no such knowledge yet still obtains near-optimal sample complexity guarantees. In contrast, we provide a lower bound showing that non-adaptive strategies require at least quadratically more samples. In characterizing this gap between adaptive and nonadaptive strategies, we make connections to anomaly detection and prove lower bounds on the sample complexity of differentiating between a single parametric distribution and a mixture of two such distributions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptivity and Computation-Statistics Tradeoffs for Kernel and Distance based High Dimensional Two Sample Testing

Nonparametric two sample testing is a decision theoretic problem that involves identifying differences between two random variables without making parametric assumptions about their underlying distributions. We refer to the most common settings as mean difference alternatives (MDA), for testing differences only in first moments, and general difference alternatives (GDA), which is about testing ...

متن کامل

ON THE POWER FUNCTION OF THE LRT AGAINST ONE-SIDED AND TWO-SIDED ALTERNATIVES IN BIVARIATE NORMAL DISTRIBUTION

This paper addresses the problem of testing simple hypotheses about the mean of a bivariate normal distribution with identity covariance matrix against restricted alternatives. The LRTs and their power functions for such types of hypotheses are derived. Furthermore, through some elementary calculus, it is shown that the power function of the LRT satisfies certain monotonicity and symmetry p...

متن کامل

A Conditional Test for Exponentiality Against Weibull DFR Alternatives Based on Censored‎ ‎Samples

‎A conditional test based on quadratic form using type-2 censored sample for testing exponentiality against Weibull alternative is proposed‎. ‎The simulated percentage points and powers are given‎. ‎The proposed test performs well for identifying Weibull DFR alternative even for small sample‎. ‎An example is also given.

متن کامل

Environmental Planning for Wind Power Plant Site Selection using a Fuzzy PROMETHEE-Based Outranking Method in Geographical Information System

Selection of suitable sites for wind power plants is one of the most important decision on wind resources development. Site selection for the establishment of large wind power plants requires spatial evaluation taking technical, economic, and environmental considerations into account. This study has applied a combination of PROMETHEE and Fuzzy AHP methods in a geographical information system en...

متن کامل

Learning with Limited Rounds of Adaptivity: Coin Tossing, Multi-Armed Bandits, and Ranking from Pairwise Comparisons

In many learning settings, active/adaptive querying is possible, but the number of rounds of adaptivity is limited. We study the relationship between query complexity and adaptivity in identifying the k most biased coins among a set of n coins with unknown biases. This problem is a common abstraction of many well-studied problems, including the problem of identifying the k best arms in a stocha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016